This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents mil not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



THlSt*6E BLW,K(US " 0) 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT)' 



3 



(51) International Patent Classification 6 : 

H04J 1/16, H04B 1/44, H04L 12/28, 
12/50, 12/56, H04Q 11/00 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 99/16195 

I April 1999 (01.04.99) 



(21) International Application Number: PCT/US98/20003 

(22) International Filing Date: 24 September 1998 (24.09.98) 



(30) Priority Data: 
08/937,066 



24 September 1997 (24.09.97) US 



(71) Applicant: EMULEX CORPORATION [US/USJ; 3535 Harbor 

Boulevard, Costa Mesa, CA 92626 (US). 

(72) Inventors: ROACH. Bradley; 1 1 1 45th Street, Newport Beach, 

CA 92663 (US). FIACCO. Peter; 3420 Fairmont, Yorba 
Linda. CA 92686 (US). SCHERER, Greg; 19621 Crestknoll 
Drive. Yorba Linda, CA 92886 (US). 

(74) Agent: LAND, John; Fish & Richardson P.C., Suite 1400, 4225 
Executive Square, La Jolla, CA 92037 (US). 



(81) Designated States: CA, JP, KR, European patent (AT, BE. CH f 
CY, DE, DK, ES, FI. FR, GB, GR, IE, IT, LU, MC, NL. 
PT. SE). 



Published 

With international search report. 



(54) Title: FULL-DUPLEX COMMUNICATION PROCESSOR 
(57) Abstract 

A full duplex communication processor (20) simultaneously 
sends and receives frames of data and commands. Separate transmit 
(32) and receive (30) protocol engines are controlled by separate 
sequencers. This enables frames of data to be received and 
transmitted simultaneously without involving the CPU (40) on a 
frame-by-frame basis. 



Cg — 




BNSDOCID: <WO 991619SA1_I_> 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL Albania 

AM Armenia 

AT Austria 

AU Australia 

AZ Azerbaijan 

BA Bosnia and Herzegovina 

BB Barbados 

BE Belgium - 

BK Burkina Faso 

BG Bulgaria 

BJ Benin 

BR Brazil 

BY Belarus 

CA Canada 

CF Central African Republic 

CC Congo 

CH Switzerland 

CI Cdte d'lvoire 

CM Cameroon 

CN China 

CU Cuba 

CZ Czech Republic 

DE Germany 

DK Denmark 

EE Estonia 



ES 


. Spain- - • ' -■.*■■■ 


' LS 


Lesotho 


SI 


Kl 


Finland 


LT 


Lithuania 


SK 


FR 


France ? ■ • 


. * LU 


Luxembourg 


SN 


CA 


Gabon 


LV 


Latvia 


SZ 


GB 


United Kingdom 


MC 


Monaco 


TO 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


GH 


Ghana 


MG 


Madagascar 


TJ 


CN 


Guinea 


MK 


The former Yugoslav 


TM 


GR 


Greece 




Republic of Macedonia 


TR 


HU 


Hungary 


ml ; 


Mali 


TT 


IE 


Ireland 


MN 


Mongolia 


UA 


It. 


Israel 


MR 


Mauritania 


UG 


IS 


Iceland 


MW 


Malawi 


US 


IT 


Italy 


MX 


Mexico 


uz 


JP 


Japan 


NE 


Niger 


VN 


KE 


Kenya 


NL 


Netherlands 


YU 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


KP 


Democratic People's 


NZ 


New Zealand 






Republic of Korea 


. PL 


Poland 




KR 


Republic of Korea 


FT 


Portugal 




KZ 


Kaxaksian 


RO 


Romania 




LC 


Saint Lucia 


RU 


Russian Federation 




LI 


Liechtenstein 


SD 


Sudan 




LK 


Sri l-anka 


SE 


Sweden 




LR 


Liberia 


SG 


Singapore 





Slovenia 

Slovakia 

Senegal 

Swaziland 

Chad 

Togo 

Tajikistan 

Turkmenistan 

Turkey 

Trinidad and Tobago 

Ukraine 

Uganda 

United States of America 

Uzbekistan 

Viei Nam 

Yugoslavia 

Zimbabwe 



BNSOOCID: <WO 9916195A1_L> 



WO 99/16195 



PCT/US98/20003 



- 1 - 

FULL-DUPLEX COMMUNICATION PROCESSOR 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

This invention relates to devices for transferring data in computer networks, and more 
5 particularly to a device for receiving, generating and transmitting frames of data across 
a computer network boundary. 

2. Description of Related Art 

The number of computers and peripherals has mushroomed in recent years. This has 
created a need for improved methods of interconnecting these devices. A wide variety 
1 0 of networking paradigms have been developed to enable different kinds of computers and 
peripheral components to communicate with each other. 

There exists a bottleneck in the speed with which data can be exchanged along such 
networks. This is not surprising because increases in network architecture speeds have 
not kept pace with faster computer processing speeds. The processing power of computer 
15 chips has historically doubled about every 18 months, creating increasingly powerful 
machines and bandwidth hungry applications. It has been estimated that one megabit per 
second of input/output is generally required . per "MIPs" (millions of instructions per 
second) of processing power. With CPUs now easily exceeding 200 MIPs, it is difficult 
for network architecture to keep up with these faster speeds. 

20 Area-wide networks (e.g. , LANs and WANs) and channels are two approaches that have 
been developed for computer network architectures. Traditional networks offer a great 
deal of flexibility and relatively long distance capabilities. Channels, such as Enterprise 
System Connection (ESCON) and Small Computer System interface (SCSI), have been 
developed for high performance and high reliability. Channels typically use dedicated 

25 short-distance connections between computers or between computers and peripherals. 
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Features of both channels and networks have been incorporated into a new network 
standard known as "Fibre Channel". Fibre Channel systems combine the speed and 
reliability of channels with the flexibility and connectivity of networks. Fibre Channel 
products currently can run at very high data rates, such as 266 or 1062 Mbps. These 
5 speeds are sufficient to handle quite demanding applications such as uncompressed, full 
motion, high-quality video. 

There are generally three ways to- deploy Fibre Channel: simple point-to-point 
connections; arbitrated loops; and switched fabrics. The simplest topology is the point-to- 
point configuration, which simply connects any two Fibre Channel systems directly. 
10 Arbitrated loops are Fibre Channel ring connections that provide shared access to 
bandwidth via arbitration. Switched Fibre Channel networks, called "fabrics", yield the 
highest performance by leveraging the benefits of cross-point switching. 

The Fibre Channel fabric works something like a traditional phone system. The fabric 
can connect varied devices such as work stations, PCs, servers, routers, mainframes, and 

15 storage devices that have Fibre Channel interface ports. Each such device can have an 
origination port that "calls" the fabric by entering the address of a destination port in a 
frame header. The Fibre Channel specification defines the structure of this frame. (This 
frame structure raises data transfer issues that will be discussed below and addressed by 
the present invention): The Fibre Channel fabric does all the work of setting up the 

20 desired connection, hence the frame originator does not need to be concerned with 
complex routing algorithms. There are no complicated permanent virtual circuits (PVCs) 
to set up. Fibre Channel fabrics can handle more than 16 million addresses, and so are 
capable of accommodating very large networks. The fabric can be enlarged by simply 
adding ports. The aggregate data rate of a fully configured Fibre Channel network can 

25 be in the tera-bit-per-second range. 

Each of the three basic types of Fibre Channel connections are shown in FIGURE 1, 
which shows a number of ways of using Fibre Channel technology. In particular, point- 
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to-point connections 10 are shown connecting mainframes to each other. A Fibre 
Channel arbitrated loop 1 1 is shown connecting disk storage units. A Fibre Channel 
switch fabric 12 connects work stations 13, mainframes 14, servers 1 5, disk drives 16 and 
local area networks (LANs) 17. The LANs include, for example, Ethernet, Token Ring 
5 and FDDI networks. 

An ANSI specification (X3.230-1994) defines the Fibre Channel network. The 
specification distributes Fibre Channel functions among five layers. As shown in 
FIGURE 2, the five functional layers of the Fibre Channel are: FC-0 - the physical media 
layer; FC-1 - the coding and decoding layer; FC-2 - the actual transport mechanism, 
10 including the framing protocol and flow control between nodes; FC-3 - the common 
services layer; and FC-4 - the upper layer protocol. 

While the Fibre Channel operates at relatively high speed, it would be desirable to 
increase speeds further to meet the needs of faster processors; One way to do this would 
be to eliminate, or reduce, delays that , occur ajt .interface points. One such delay occurs 
during the transfer of a frame from the .EG* J, layer to the FC-2 layer. At this interface, 
devices linked by a Fibre Channel data link receive Fibre Channel frames serially. A 
protocol engine receives these frames and processes them at the next layer, the FC-2 
layer shown in FIGURE 2. The functions of the protocol engine includes validating each 
frame; queuing up DMA operations tp transfer each frame to the host; and building 
transmit frames. 

The high bit speeds of the Fibre Channel data link places extreme demands on the 
protocol engine. Hence, some protocol engines can only operate in half-duplex mode, 
which means that the protocol engine can process data in only one direction at a time. 
This significantly slows down the speed of the data transfer since either the transmit or 
25 the receive task must wait while the other task is performed. 
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Full-duplex protocol engines can process both received and transmitted frames 
simultaneously. Hence full-duplex protocol engines significantly improve data 
throughput However, in full-duplex protocol engines, usually two microprocessors, each 
with local RAM, are required to respectively handle transmit and receive operations. The 
5 use of dual microprocessors for these functions greatly increases the cost of the protocol 
engine. \ „ 

* * i* 

Conventional approaches to handling frames generally rely on the involvement of a host 
GPU on a frame-by-frame basis. For example, validation of received frames and setting 
up direct memory access (DMA) operations and acknowledgments typically involve the 
10 host CPU, which limits frame transmission and reception rates and prevents the host 
CPU from performing other tasks. Further* a host CPU with software protocol "stacks" 
cannot keep up with fast networks such as Fiber Channel. 

In view; of the foregoing,. objects of the invention include: increasing data transfer 
processing speeds in high speed networks such as the Fibre Channel network; providing 
1 5 a technique that can speed up a protocol engine's processing of data frames; providing 
a protocol engine that can perform high-speed full duplex processing of data without 
involving the host CPU on a frame-by-fraihe basis; minimizing data traffic between a 
protocol engine and a host CPU and system memory; and generally offloading protocol 
processing from a host CPU. 

20 SUMMARY OF THE INVENTION 

The invention is directed to the processing and transferring of frames of data in a 
computer data link. The invention is a full-duplex communication processor that uses 
dual micro-coded engines and specialized hardware to build transmit frames, validate 
receive frames, and set up host DMA operations without involving a host CPU and 

25 without one or more resident microprocessors. A preferred embodiment of the invention 
uses independent dedicated transmit and receive protocol processors. These independent 
processors communicate with each other using a transfer ready queue. A context manager 
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provides context information that is used by the receive processor to validate received 
frames and by the transmit processor to build transmit frame headers. 

More particularly, a preferred embodiment of the invention: 1) implements a full-duplex 
communication processor with independent transmit and receive processors that 
5 communicates directly to the host driver software through host memory resident 
"command", "unsolicited data", and "response" rings; 2) establishes communication 
between the dual processors to allow the receive processor to queue work for the transmit 
processor, which allows a remote device send a frame to the receive processor that will 
"wake-up" the transmit processor to send data to the remote device without involving the 
10 host CPU; and 3) establishes an interlocked information table that allows the transmit 
and receive processors to operate on the same Input/Output. (I/O) command. 

In a preferred embodiment, the data channel is a Fibre Channel data link and the full- 
duplex communication processor is configured to process FC^2 protocol Fibre Channel 
frames. ho " rt - c -*! ■ 

- •: v j-r ;_.-::> ;< f \r*-r . '. 

15 The details of the preferred embodiment -ofhthe present rinvention are set forth in the 
accompanying drawings and the description -below;. Once the details of the invention are 
known, numerous additional innovations and changes will become obvious to one skilled 
in the art. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 FIGURE 1 is a block diagram of a : prior* art complex computer network utilizing Fibre 
Channel technology. ^ ; - 

FIGURE 2 is a diagram of the fiver functional layers of the prior art Fibre Channel 
standard. 5 . ■ * 
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FIGURE 3 is a simplified block diagram of a communication processing system in 
accordance with a preferred embodiment of the invention. 

FIGURE 4 is a diagram of a typical prior art Fibre Channel frame of data. 

FIGURE 5 is a simplified block diagram of a full-duplex communication processor in 
5 accordance with a preferred embodiment of the invention. 

FIGURE 6 is a diagram of an Exchange Context Resource Index (XRI) in accordance 
with a preferred embodiment of the "invention. 

Like reference numbers and designations in the various drawings refer to like elements. 



DETAILED DESCRIPTION OF THE INVENTION 

10 The invention includes a fulWuplex cornmuriication processor that improves frame 
transmission and frame reception > Pates in high speed data links such as the Fibre 
Channel. By using independent transmit ; ahd receive' microcoded engines communicating 
directly to host driver software, fullKlupiex network communication is accomplished 
without involving the host CPU. - • - 

15 FIGURE 3 shows a Fibre Channel communication system 20 utilizing the full-duplex 
communication processor 22 in accordance with a preferred embodiment of the 
invention. Serial data is received along a Fibre Channel data link 24. Frames generally 
will comprise three portions, a preamble, a data or "payload" portion, and a trailer 
portion. In a Fibre Channel data link, for example, the Fibre Channel frame consists of 

20 a start of frame (SOF) word (four bytes); a data portion comprising a frame header (six 
bytes), between zero and 2112 payload bytes, and a cyclical redundancy check (CRC) 
word (4 bytes); and an end of frame (EOF) word (4 bytes). The frame header is used to 
control link applications, control device protocol transfers, and detect missing or out of 
order frames. The CRC word indicates whether there is a problem in the transmission, 
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such as a data corruption, or whether some part of the frame was dropped during 
transmission. 

Frames received from the. Fibre Channel data link 24 are processed by an NL port 36 
which decodes and parallelizes the incoming serial data into words. The NL port 36 
5 assembles the words into frames. The NL port 36 also checks the CRC word for each 
frame received and adds a resulting "good-bad" CRC status indicator to other status 
information bits within an EOF status word that is generated from the EOF word. The 
NL port 36 then writes the frames into a receive frame FIFO buffer 28. Further details 
of a preferred FIFO buffer module 28 are described in co-pending patent application 
10 entitled "RECEIVE FRAME FIFO WITH END OF FRAME BYPASS", Serial No. 
08/935,898 filed on September 23, 1997, and assigned to the same assignee of the present 
invention, the disclosure of which is incorporated by reference. 

Fibre Channel frames are then received by the full-duplex communication processor 22. 
also referred to as a protocol engine. Several functions are performed by the full duplex 
1 5 communication processor 22, including: . l r ) r flue t uing up a host command to write the data 
in a received frame into host memory a through direct memory access (DMA); 2) 
validating the frame header to ensure that the frame is the next logical frame that should 
be received; 3) determining whether the frame is defective or not; and 4) generating 
transmit frames in response to a received frame or host-generated transmit command. 

20 Unlike conventional protocol engines, the full-duplex- communieatipn processor 22 does 
not include a microprocessor. Instead, dual microcoded engines are employed in order 
to separate the protocol engine receive tasks from the protocol engine transmit tasks. In 
particular, the full-duplex communication processor 22 includes a receive protocol 
engine 30 and a transmit protocol engine 32. These protocol engines communicate to 

25 each other through a transfer ready queue, 60. The receive protocol engine 30 validates 
the receive frame headers received from the receive frame buffer 28. The transmit 
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protocol engine 32 builds transmit frames and sends them to the Fibre Channel data link 
24 through a transmit FIFO 66 and the NL port 36. 

The fiill duplex communication processor 22 works in conjunction with a host computer 
40 that includes host driver software 38 and host memory 42. In particular, the transmit 
5 and receive protocol engines 30, 32 communicate directly to the host driver software 38. 
Full-duplex communication is achieved because the receive and transmit protocol 
engines operate independently and concurrently. An interlocked context information 
table is used to permit the receive and transmit protocol engines to operate on the same 
I/O command, as described in more detail below. 

1 0 The full-duplex communication processor 22 is able to process frames without involving 
the host CPU on a frame-by-frame basis. For example, one function of the full-duplex 
communication processor 22 is to allow a remote device to send a frame along the Fibre 
Channel link 24 to the receive protocol engine 30 which will "wake up" the transmit 
protocol engine 32 to send data to the-rem6te device through the NL port 36 to the Fibre 

1 5 Channel link 24. Such data may reside^ for example, in the host memory 42. 

FIGURE 5 shows additional details of the full-duplex communication processor 22 of 
a preferred embodiment of the invention. The full-duplex communication processor 22 
includes data structures resident in host memory 42, which may include contiguous and 
non-contiguous physical memory. , • ; . 

20 A Fibre Channel fiame is received by the receive protocol engine 30 through the NL port 
36. An NL port status unit 44 performs the function of timing receive frame sequence and 
monitoring NL port interrupts. The received frame is sent through a sequencer 46 to a 
receive buffer control unit 48 which places the received frame in a receive buffer 50. The 
frame header in the receive buffer 50 is then automatically placed into the receive 

25 protocol engine 30. 
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A lookup field inside each frame header includes a pointer to an associated context. In 
general, the associated context is initialized by the host driver 38 within the host memory 
42, and contains information indicating where to put a particular frame of data in host 
memory 42. More particularly, the context contains fields such as maximum frame size, 
5 current buffer pointer and length, and small computer systems interface (SCSI) state 
information, defined in a list of buffers. 

The host memory unit 42 typically will comprise: many megabytes of memory, and each 
particular frame will fit into one slot in that memory: Each frame header tells the receive 
protocol engine 30 which context to accessor "pull down"Tor that particular frame so 
1 0 that the receive protocol engine can validate that frame . The context is pulled down from 
the host memory 42 under control of the context manager engine 52 through a host 
memory interface 54. The receive protocol engine sequencer.46 then validates the frame. 

Once frame validation is complete, the context pointed to by a frame header will tell the 
receive protocol engine 30 what to do with the-ixame. There are a number of possibilities, 

1 5 including: 1 ) send the frame out the Routing Gontrol/Type ;(R_: CTL/T YPE) ring control 
unit 56 where it then is sent to host memory 42 through the host memory interface 54; 
2) send the frame through the Buffer List: ring -control unit 58 .to one segment in the 
buffer pointer list inside host memory;, and 3 r ) process a nonniata receive frame and 
. associated payload . (For example, the frame may be a communication frame such as a 

20 "transfer ready" that tells the transmitter that the target is now. ready to accept data. This 
would cause the receive frame to pass to the transfer ready queue 60. The transmit 
command would then be sent to the .transmit protocol engine 32). 

The second case involves sending a frame to a buffer pointer list, which is a sequential 
list of buffer descriptors. The first entry in the list contains the total transfer size in bytes. 
25 In the illustrated embodiment, only word; transfers are performed by the full-duplex 
communication processor 22. Hence, if the total transfer size is not an integral number 
of 4-byte words, additional bytes are transferred to the next boundary. Subsequent entries 
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in the buffer list consist of two parts each, one part being a multi-byte address that points 
to the start of a buffer and the other part being the size and usage of the buffer. 

In accordance with the invention, each buffer pointer list includes buffer list modifier 
(BLM) bits that describe the buffer usage and which are used to build an outgoing Fibre 
5 Channel frame header (the FC-2 header) for each transmit frame. The full-duplex 
communication processor 22 must build the frame header and corresponding frame 
control (F_CTL) bits, and transfer the frame header to a transmit FIFO 66 before 
transferring the payload via a DMA operation. The BLM bits and the buffer lengths in 
the buffer lists assist the full-duplex communication processor 22 in determining whether 

1 0 a frame is the last one in a series of frames. For the receive protocol engine, the BLM bits 
control proper placement of received data and status information into the buffer 
segments. The BLM bits are described in related patent application serial number 
08/937,065 entitled "COMMUNICATION' PROCESSOR HAVING BUFFER LIST 
MODIFIER CONTROL BITS", filed on September 24, 1 997, and assigned to the same 

15 assignee as the present invention, the f disclosure of which is incorporated by reference. 

One example of a task performed by the~full-duplex communication processor 22 is the 
processing of a command to write data to a disk drive on the Fibre Channel link 24 from 
a remote device. A write command is sent and the full-duplex communication processor 
transfers the command to the disk drive which sends back a transfer ready message to the 
20 receive protocol engine 30 indicating that the disk drive is ready to accept the data. This 
message goes to the transfer ready queue 60 which instructs the transmit protocol engine 
32 to retrieve the data from host memory 42, generate a frame and transmit the data to 
the disk drive. 

The transmit protocol engine 32 is triggered by either of two events: one is the presence 
25 of an entry in the transfer ready queue 60, and the other is by action of the command ring 
controller 62. An Exchange Context Resource Index (XRI), described below, is used to 
process each command. The command ring is a circular queue of command entries, 
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generally read and write commands. These read and write commands can be used, for 
example, to communicate commands to a remote device such as a disk drive. The size 
and base memory address of the command ring is specified in a command ring base 
register which contains "put" and "get" pointers that are used for managing the command 
5 ring as follows: the host driver 38 manages the put pointer, incrementing the pointer 
whenever a command is queued to the command ring 62. The full-duplex communication 
processor 22 manages the get pointer, incrementing the pointer whenever a command is 
read from the ring. 

A command other than a full-frame transmission provides a pointer to a buffer pointer 
list. The buffer pointer list contains the total transfer size in the first buffer list entry and 
buffer pointer-size pairs in subsequent buffer list entries. The XRI field in the command 
will then be used to instruct the context manager 52 to pull down the appropriate context 
to the transmit protocol engine 32. This transfer, called ;an exchange, tells the transmit 
protocol engine 32 where the engine is in that particular buffer tiflg list, how much data 
the frame has and what stage it is in, etc. Hie context also contains the next frame header. 
The next frame header is initially built by the host driver 38 but thereafter the transmit 
protocol engine 32 builds subsequent frarr^e. headers. The. context manager 52 retrieves 
each frame header from the host memory 42 and passes the header to the transmit header 
controller 68, which sends the frame header to the NL port 36 through the transmit FIFO 
66. < 

Once a frame header is built, the system begins folloyving the buffer list in a process that 
gathers data from host memory. The context for a cpmipand contains a pointer to the 
buffer list. One entry at a time is pulled down from the buffer list by the buffer list ring 
controller 70. The frame header is transferred to the transmit FIFO 66 through a transmit 
25 header control 68. A payload segmenter 12 begins to. pull in payload (frame data) and put 
the payload data into the transmit FIFO 66. Once a frame header and the payload data are 
in the transmit FIFO. 66, the last task is to write an end of frame (EOF) word to the 
transmit FIFO 66. The EOF word is an indication to the NL port 36 to begin transmitting 
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the assembled frame onto the Fiber Channel link 24. Once all frames are sent out 
successfully, a response is generated which is sent to the host driver 38, indicating that 
the frames associated with the pending command were, in fact, sent out successfully. 

Likewise, the receive protocol engine 30 contains an acknowledge FIFO 74 which 
5 generates an acknowledge frame (basically a modified form of the receive frame header) 
that is sent back over the Fibre Channel link 24 to the sender to acknowledge receipt. 

The full-duplex communication processor 22 also includes receive and transmit protocol 
engine registers 76 and 78. These registers contain autonomous protocol management 
functions that are linked and synchronized through the context registers in the context 
10 manager 52. The context manager 52 manages coherency and caching of exchange 
context from the host memory 42, and also synchronizes accesses by the receive and 
transmit protocol engine 30, 32 to the cached exchange context contained in the context 
registers 80. : — f . : ; » i ^ ? . ^ 

In the preferred embodiment, the context manager 52 and the receive and transmit 
1 5 protocol engines 30, 32 communicate with the' host 40 through host memory interface 54 
which includes a peripheral components interface (PCI), direct memory access (DMA) 
controller (not shown), -and a PCI slave interface (not shown). The protocol engine 
registers 76, 78 contain the PCI slave interface and interrupt controller for the protocol 
engines 30, 32. The context manager 52. receive and transmit protocol engines 30, 32 
20 provide status to and froni the protocol engine register 76, 78 for the PCI slave interface 
and interrupt controller. ' - 1 

The receive and transmit protocol engines 30, 32 implement the Fibre Channel protocol 
by using two independent programmable sequencers 46 and 63. The use of sequencers 
46, 63 allows the protocol engine state machine to be implemented in a variable writable 
25 control store RAM, which is downloaded into the receive and transmit protocol engine 
registers 76, 78 during initialization. The host 40 can access this writable control store 
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RAM and can read and write the writable control store RAM through a protocol register 
map. The use of the sequencers adds great flexibility to the protocol engine state machine 
implementation since, by changing code in the writable control store RAM, new or 
different functionality can be downloaded to the full-duplex communication processor 
5 22. 

The full-duplex communication processor 22 can be implemented on a single chip (such 
as an application specific integrated circuit (ASIC)), alone or together with other 
functions. For example, in the illustrated embodiment, the full-duplex communication 
processor 22 can cache one instance of the most recent transmit and receive context. 
1 0 However, by adding additional on-chip memory, additional instances of context can be 
cached. 

XRI Context ; . 

In the preferred embodiment, each context is divided into two host memory structures: 
remote port context and exchange context. An exchange context is contained in an 

15 Exchange Context Resource Index (XRI) .which is used to process a command. In 
particular, an exchange context is a structure that describes a complete exchange or 
controls transmission of one or more sequences;.- The structure is pointed to by an entry 
in an exchange pointer table. An XRI context contains the supporting context needed for 
an operation to take place immediately or through' separate, sequences. The data to send, 

20 or the buffers to receive data, are described by a buffer pointer, list consisting of a set of 
buffer list entries that point to the actual, buffers. As, described above, a buffer list entry 
contains the address and length of a buffer and control bits to indicate sequence initiative, 
end of exchange, end of sequence, etc. For multiple-sequence operations the XRI context 
provides storage for working-register contents. 

25 In the preferred embodiment, the XRI contexts are used by the full-duplex 
communication processor 22 for: Fibre Channel Protocol (FCP) exchanges that it 
originates; transmission exchanges; and temporary purposes to control transmission of 
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a single frame or multiple frame sequences. The XRI context may be used by the host 
driver 38 for keeping track of exchanges for which data is received into buffer ring 
buffers. 

An example of an XRI context is shown in FIGURE 6. The first word is the XRI control- 
5 status word. The XRI control-status word contains configuration fields that are set by the 
host driver. The Total Transfer Size word reflects sequencer activity. For Fibre Channel 
protocol (FCP) originated exchanges, the XRI control-status word shows the remaining 
byte count for write operations and the cumulative received byte count for read 
operations. For transmit sequence 4 commands, the XRI control-status word shows the 
10 remaining byte count if the operation is halted before the complete sequence is 
transmitted. The Rxeng control-status word is used by the receive protocol engine 30 to 
validate frames. The Current Buffer List Address word reflects sequencer activity, as 
does the Current Buffer Offset Address word.^ 

The buffer list modifier (BLM) bits are set from the corresponding bits of a buffer list 
15 entry (BLE) read under sequencer control. The Residual Buffer Length in word five 
reflects sequencer activity. Whenever a sequencer reads a BLE, this field receives the 
buffer word count. Whenever a sequencer issues a DMA operation to transfer data to or 
from the buffer, the word count is reduced by the length of the transfer data. The Current 
Buffer Burst Length word alsoreflects transmit sequencer activity. 

20 The Fibre Channel FC^2 header, in words 7-12, is used to generate header information 
for each frame transmitted by the transmit protocol engine 22. 

Referring again to FIGURE 5, the R_CTL/TYPE ring control 56 controls buffer rings 
that are used to receive all frames except FCP responder frames, i.e., for a locally 
originated FCP exchange. Three R_Cn7TYPE buffer rings assist the host in 
25 demultiplexing incoming frames for the appropriate driver entry points. An 
R_CTL/TYPE buffer ring is a fixed-size sequential list of buffer descriptors. The list is 
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managed by hardware as a logical ring. Buffer descriptors are like buffer list entries in 
a buffer pointer list, but do not contain BLM bits. 

The host driver 42 specifies the location and size of each buffer ring in the corresponding 
base register. Specific registers specify which entries in the R CTL/TYPE buffer rings 
5 are valid. Each register consists of a put pointer and a get pointer. The receive buffers for 
each ring are used in the exact order in which the host driver put the corresponding buffer 
descriptions into the ring. _ 

Loop initialization is initiated locally under host control or remotely by some other port. 
The host driver 38, the transmit and receive sequencers 46, 63,and the NL port 36 logic, 
10 all work together to complete the loop initialization procedure. During this procedure, 
the host driver either originates or passes ,on Fibre Channel Extended Link Services 
(ELS) frames that determine the addresses aijdxap^bUities of the ports on the loop. The 
host driver 38 is responsible for issuing Loop Initialization Select Master (LISM) ELS 
frames which facilitate the loop initialization, process, j : ; . ; - 

1 5 Initialization is needed because both the receive and transmit protocol engines 30, 32 are 
basically two autonomous engines, running 5 in full-duplex ^ and ^ they have minimal 
communication between the two of them. During initialization, the transmit protocol 
engine 32 is turned off and the receive protocol engine 30 is allovyed to receive frames 
and then send them through the transmit protocol engine 32. Thus, the receive protocol 

20 engine takes up "ownership" of the transmit protocol engine hardware and uses that 
hardware to forward frames, in particular, the LISM frames which are transmitted 
utilizing the transmit LISM control module 82. 

i . . * ; 

The full-duplex communication processor of the present invention allows the 
development of a low-cost network interface card that does not require a microprocessor 
25 with local RAM, and further offers the following advantages: 
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• independent transmit and receive processors; 

• the receive processor can queue work for the transmit processor; 

• an interlocked information table allows the transmit and receive engines to 
operate on the same I/O command; 

5 • the transmit and receive processors can follow a single buffer list where the 

host can queue memory buffers that are a different size from the transmit or 
receive frames; 

• . receive frames are validated against the interlocked information table without 

involving the host CPU; u 
10 • acknowledges to receive frames are generated without involving the host 

CPU; . 

• efficient suspend/resume of I/O operations. 



A number of embodiments of the present invention have been described. Nevertheless, 
it will be understood that various modifications may be made without departing from the 
1 5 spirit and scope of the invention; Accordingly/ it is to be understood that the invention 
is not to be limited by the specific illustrated:embodiments, but only by the scope of the 
claims. " 1 
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CLAIMS 

What is claimed is: 

1 . A communication processor comprising: 

(a) a receive processor coupled to a computer network for receiving and 
5 validating frames of data, wherein the frames of data include 

commands; 

(b) a transmit processor coupled to said computer network for 
constructing transmit frames; 

(c) a transfer ready queue coupled between the transmit and receive 
1 0 processors, for transmitting commands in each frame of data having 

commands from the receive processor to the transmit processor; 

(d) an interface circuit for coupling the receive and transmit processors 
and the transfer ready queue with a programmed host computer 
including a CPU anclhpst memory; and 

15 (e) interlocked information table containing context information wherein 

the receive frame processor uses the context information to process 
receive frames without involving the CPU. 

2. A communication processor according to claim 1, wherein the receive 
processor uses the context information to validate receive frames. 



20 3. A communication processor according to claim L wherein the transmit 

processor uses the context information to build transmit frames. 

4. A communication processor according to claims 2 or 3, further comprising 

a context manager for processing the context information, the context 
manager being under the control of host driver software. 
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5. A communication processor according to claim 1 , wherein the transmit and 

receive processors each comprise independent microcoded engines, whereby 
the receive and transmit processors can each simultaneously process 
respective receive and transmit frames. 

5 6. A communication processor according to claim 5, wherein the microcoded 

engines are sequencers, and the communication processor does not include 
a microprocessor. - / . 

7. A communication processor according to claim 1 , wherein the frames of data 

are Fibre Channel frames. > • 

10 8. A communication processor according to claim 7, wherein the receive and 

transmit processors each implement an FC-2 Fibre Channel communications 
protocol. 



9. A communication processor according to claim 1, wherein the receive and 
transmit processors and the transfer ready queue are contained within a single 

15 integrated circuit. 

10. A communication processor according to claim 1, wherein the interface 
circuit includes a direct memory access (DMA) interface. 
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11. A method of processing frames of data in a computer network comprising: 

(a) receiving a first frame of data in a microcoded receive processor 
coupled to the computer network; 

(b) transferring the first frame from the receive processor to a transfer 
ready queue; 

(c) storing contextual information relating to the frame of data in an 
information table; 

(d) using the contextual information to validate the first frame of data; 

(e) transferring the first frame of data from the transfer ready queue to a 
microcoded transmit processor coupled to the computer network; 

(f) constructing a transmit frame using information stored in the first 
frame of data in the transmit processor; and 

(g) transferring the transmit frame to the computer network while 
simultaneously receiving, a i second frame of data in the receive 
processor. 



12. A method according to claim. 1 1, wherein the first and second frames of data 
are Fibre Channel frames ;anduthe,< step of constructing a transmit frame 
includes the step of constructing a Fibre Channel transmitirame. 

13. A method according, to claim. 11, further comprising the step of processing 
the context information in a context manager. -.: ■: - ■ ; • ■ . . : 

14. A method according to claim 1 1 , further comprising the steps of coupling the 
receive and transmit processors to a host computer, and using host driver 
software in the host computer to control the receive and transmit processors. 

15. A method of according to claim 1 4, further comprising the step of storing the 
contextual information in memory located in the host computer. 
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16. A method according to claim 14, further comprising the step of coupling the 
receive and transmit processors to the host computer through a direct memory 
access (DMA) interface.- 

17. A computer network comprising: 

5 (a) source and destination computer devices; 

(b) communication channel coupled to the source and destination 
computer devices; c 

(c) a receive processor coupled to the communication channel for 
receiving and validating frames of data from the source computer 

1 0 device, wherein the frames of data include commands; 

(d) a transmit processor coupled to the computer network for 
constructing transmit. frames; 

(e) a transfer ready queue coupled between the transmit and receive 
processors, for transmitting commands in the frames of data having 

1 5 commands from th& receive processor to the transmit processor; 

(f) a host computer including a CPU, memory and host driver software; 

(g) interface means for coupling the receive and transmit processors and 
the transfer ready queue with the host computer; and 

(h) interlocked information table containing context information wherein 
20 the receive frame processor uses the context information to process 

the receive frames without involving the CPU. 

18. A computer network according to claim 17, wherein the receive frame 
processor uses the context information to validate receive-frames. 

19. A computer network according to claim 17, wherein the transmit frame 
25 processor uses the context information to build transmit frames. 
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20. A computer network according to claim 17, further comprising a context 
manager for processing the context information, the context manager being 
under the control of the host driver software. 

21. A computer network according to claim 17, wherein the frames of data are 
5 Fibre Channel frames. 

22. A computer network according to claim 2 1 , wherein the receive and transmit 
frame processors each implement an FC-2 Fibre Channel communications 
protocol. 

23. A computer network according to claim 17, wherein the interface means 
10 includes a direct memory access (DMA) interface. 

24. A communication processor according to claim,! 7, ; wherein the interlocked 
information table is located in the. host memory 

25. A communication processpri/aecprding.to claim; 17, wherein the context 
information includes remote pQrtcoritext and exchange context. 

15 26. A communication processor^according to : claim; : 17, wherein the frame 

includes a header having lookup .fields . indicating particular context 
information. 
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